Semi-supervised classification learning by discrimination-aware manifold regularization

نویسندگان

  • Yunyun Wang
  • Songcan Chen
  • Hui Xue
  • Zhenyong Fu
چکیده

Manifold regularization (MR) provides a powerful framework for semi-supervised classification (SSC) using both the labeled and unlabeled data. It first constructs a single Laplacian graph over the whole dataset for representing the manifold structure, and then enforces the smoothness constraint over such graph by a Laplacian regularizer in learning. However, the smoothness over such a single Laplacian graph may take the risk of ignoring the discrimination among boundary instances, which are very likely from different classes though highly close to each other on the manifold. To compensate for such deficiency, researches have already been devoted by taking into account the discrimination together with the smoothness in learning. However, those works are only confined to the discrimination of the labeled instances, thus rather limited in boosting the semi-supervised learning. To mitigate such an unfavorable situation, we attempt to discover the possible discrimination in the available instances first by performing some unsupervised clustering over the whole dataset, and then incorporate it into MR to develop a novel discrimination-aware manifold regularization (DAMR) framework. In DAMR, instances with high similarity on the manifold will be restricted to share the same class label if belonging to the same cluster, or to have different class labels, otherwise. Our empirical results show the competitiveness of DAMR compared to MR and its variants likewise incorporating the discrimination in learning. & 2014 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancement of ELM by Clustering Discrimination Manifold Regularization and Multiobjective FOA for Semisupervised Classification

A novel semisupervised extreme learning machine (ELM) with clustering discrimination manifold regularization (CDMR) framework named CDMR-ELM is proposed for semisupervised classification. By using unsupervised fuzzy clustering method, CDMR framework integrates clustering discrimination of both labeled and unlabeled data with twinning constraints regularization. Aiming at further improving the c...

متن کامل

A Semi-supervised Method for Multimodal Classification of Consumer Videos

In large databases, the lack of labeled training data leads to major difficulties in classification. Semi-supervised algorithms are employed to suppress this problem. Video databases are the epitome for such a scenario. Fortunately, graph-based methods have shown to form promising platforms for Semi-supervised video classification. Based on multimodal characteristics of video data, different fe...

متن کامل

Semi-supervised Collaborative Text Classification

Most text categorization methods require text content of documents that is often difficult to obtain. We consider “Collaborative Text Categorization”, where each document is represented by the feedback from a large number of users. Our study focuses on the semisupervised case in which one key challenge is that a significant number of users have not rated any labeled document. To address this pr...

متن کامل

Manifold Regularized Discriminative Neural Networks

Unregularized deep neural networks (DNNs) can be easily overfit with a limited sample size. We argue that this is mostly due to the disriminative nature of DNNs which directly model the conditional probability (or score) of labels given the input. The ignorance of input distribution makes DNNs difficult to generalize to unseen data. Recent advances in regularization techniques, such as pretrain...

متن کامل

Linear Manifold Regularization for Large Scale Semi-supervised Learning

The enormous wealth of unlabeled data in many applications of machine learning is beginning to pose challenges to the designers of semi-supervised learning methods. We are interested in developing linear classification algorithms to efficiently learn from massive partially labeled datasets. In this paper, we propose Linear Laplacian Support Vector Machines and Linear Laplacian Regularized Least...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neurocomputing

دوره 147  شماره 

صفحات  -

تاریخ انتشار 2015